Mixed Observability Predictive State Representations

نویسندگان

  • Sylvie C. W. Ong
  • Yuri Grinberg
  • Joelle Pineau
چکیده

Learning accurate models of agent behaviours is crucial for the purpose of controlling systems where the agents’ and environment’s dynamics are unknown. This is a challenging problem, but structural assumptions can be leveraged to tackle it effectively. In particular, many systems exhibit mixed observability, when observations of some system components are essentially perfect and noiseless, while observations of other components are imperfect, aliased or noisy. In this paper we present a new model learning framework, the mixed observability predictive state representation (MO-PSR), which extends the previously known predictive state representations to the case of mixed observability systems. We present a learning algorithm that is scalable to large amounts of data and to large mixed observability domains, and show theoretical analysis of the learning consistency and computational complexity. Empirical results demonstrate that our algorithm is capable of learning accurate models, at a larger scale than with the generic predictive state representation, by leveraging the mixed observability properties.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Transfer from Multiple Linear Predictive State Representations (PSR)

In this paper we tackle the problem of transferring policy from multiple partially observable source environments to a partially observable target environment modeled as predictive state representation. This is an entirely new approach with no previous work, other than the case of transfer in fully observable domains. We develop algorithms to successfully achieve policy transfer when we have th...

متن کامل

Rcd Rules and Power Systems Observability

Power system state estimation is a process to find the bus voltage magnitudes and phase angles at every bus based on a given measurement set. The state estimation convergency is related to the sufficiency of the measurement set. Observability analysis actually tests this kind of problem and guarantees the state estimation accuracy. A new and useful algorithm is proposed and applied in this pape...

متن کامل

Observability-Enhanced PMU Placement Considering Conventional Measurements and Contingencies

Phasor Measurement Units (PMUs) are in growing attention in recent power systems because of their paramount abilities in state estimation. PMUs are placed in existing power systems where there are already installed conventional measurements, which can be helpful if they are considered in PMU optimal placement. In this paper, a method is proposed for optimal placement of PMUs incorporating conve...

متن کامل

Model Predictive Control - Ideas for the next Generation

Mixed Logical Dynamical (MLD) systems are introduced as a new system type. The MLD form is capable to model a broad class of systems arising in many applications: linear hybrid systems; sequential logical systems (finite state machines, automata); nonlinear dynamic systems, where the nonlinearity can be expressed through combinational logic; some classes of discrete event systems; constrained l...

متن کامل

Modelling Sparse Dynamical Systems with Compressed Predictive State Representations

Efficiently learning accurate models of dynamical systems is of central importance for developing rational agents that can succeed in a wide range of challenging domains. The difficulty of this learning problem is particularly acute in settings with large observation spaces and partial observability. We present a new algorithm, called Compressed Predictive State Representation (CPSR), for learn...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013